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File: USPT 



Jul 5, 2005 



r 1. Document ID: US 6915241 B2 

L18: Entry 1 of 28 
US -PAT-NO : 6915241 

DOCUMENT- IDENTIFIER: US 6915241 B2 



TITLE: Method for segmentation and identification of nonstationary time series 
DATE- ISSUED: July 5, 2 0 05 



INVENTOR- INFORMATION : 
NAME 

Kohlmorgen; Jens 
Lemm; Steven 



CITY 

Berlin 

Berlin 



STATE 
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DE 

DE 
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File: USPT 
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US-PAT-NO: 6915212 
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DATE- ISSUED: July 5, 2005 
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Kamps; Bill 
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L18: Entry 3 of 28 



File: USPT 



Apr 26, 2005 



US-PAT-NO: 6885968 

DOCUMENT- IDENTIFIER: US 6885968 B2 

TITLE: Vehicular exterior identification and monitoring system-agricultural product 
distribution 

DATE-ISSUED: April 26, 2005 



INVENTOR- INFORMATION : 
NAME CITY 

Breed; David S. Boonton Township, Morris County 

DuVall; Wilbur E. Kimberling City 

Johnson; Wendell C. Signal Hill 



STATE ZIP CODE COUNTRY 

NJ 

MO 

CA 



US - CL - CURRENT : 702/143; 702/142, 702/149, 702/159 



Classification Date | Reference | | $%C|MQn 



r 4. Document ID: US 6850824 B2 

L18: Entry 4 of 28 File: USPT Feb 1, 2005 

US-PAT-NO: 6850824 

DOCUMENT- IDENTIFIER: US 6850824 B2 

TITLE: Method and apparatus for controlling a vehicular component 
DATE -ISSUED: February 1, 2005 
INVENTOR- INFORMATION : 

NAME CITY STATE ZIP CODE COUNTRY 

Breed; David S. Boonton Township, Morris County NJ 

US -CL- CURRENT: 701/36; 701/29, 701/34 
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r 5. Document ID: US 6784379 B2 

L18: Entry 5 of 28 File: USPT Aug 31, 2004 

US-PAT-NO: 6784379 

DOCUMENT- IDENTIFIER: US 6784 379 B2 

TITLE: Arrangement for obtaining information about an occupying item of a seat 
DATE-ISSUED: August 31, 2004 
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Breed; David S. Boonton Township, Morris County 

DuVall; Wilbur E. Kimberling City 
Johnson; Wendell C. Signal Hill 
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MO 

CA 
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DOCUMENT- IDENTIFIER: US 6782316 B2 
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DATE- ISSUED: August 24, 2004 



INVENTOR- INFORMATION : 
NAME CITY 

Breed; David S. Boonton Township, Morris County 

DuVall; Wilbur E. Kimberling City 

Johnson; Wendell C. Signal Hill 



STATE ZIP CODE COUNTRY 

NJ 

MO 

CA 



US -CL- CURRENT: 701/49; 180/273, 318/467, 701/45 
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L18: Entry 7 of 28 File: USPT Jun 29, 2004 

US-PAT-NO: 6757602 

DOCUMENT- IDENTIFIER: US 6757 602 B2 

TITLE: System for determining the occupancy state of a seat in a vehicle and 
controlling a component based thereon 

DATE-ISSUED: June 29, 2004 

INVENTOR- INFORMATION : 

NAME CITY STATE ZIP CODE COUNTRY 

Breed; David S. Boonton Township, Morris County NJ 

DuVall; Wilbur E. Kimberling City MO 

Johnson; Wendell C. Signal Hill CA 

Morin; Jeffrey L. Lincoln Park MI 
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L18: Entry 8 of 28 File: USPT May 18, 2004 
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DOCUMENT- IDENTIFIER: US 673 8697 B2 

TITLE: Telematics system for vehicle diagnostics 

DATE-ISSUED: May 18, 2004 

INVENTOR- INFORMATION : 
NAME CITY 

Breed; David S. Boonton Township, Morris County 
US -CL- CURRENT: 701/29; 701/34 



STATE ZIP CODE COUNTRY 
NJ 
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L18: Entry 9 of 28 



File: USPT 
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US-PAT-NO: 6735506 

DOCUMENT- IDENTIFIER: US 6735506 B2 
TITLE: Telematics system 
DATE-ISSUED: May 11, 2004 



INVENTOR- INFORMATION : 
NAME 

Breed; David S. 
DuVall; Wilbur E. 
Johnson; Wendell C. 



CITY STATE 

Morris County NJ 

Kimberling City MO 

Signal Hill CA 



ZIP CODE COUNTRY 



US -CL- CURRENT: 701 /36 
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L18: Entry 10 of 28 



File: USPT 



Feb 24, 2004 



US-PAT-NO: 6697457 

DOCUMENT- IDENTIFIER: US 6697457 B2 

TITLE: Voice messaging system that organizes voice messages based on detected 
emotion 

DATE- ISSUED: February 24, 2004 



INVENTOR- INFORMATION : 
NAME 

Petrushin; Valery A. 



CITY 

Buffalo Grove 



STATE ZIP CODE 
IL 



COUNTRY 



US -CL- CURRENT: 379/88.08; 704/205, 704/209, 704/270 
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L18: Entry 11 of 28 File: USPT 

US-PAT-NO: 6671391 

DOCUMENT- IDENTIFIER: US 66713 91 Bl 

TITLE: Pose-adaptive face detection system and process 
DATE- ISSUED: December - 3 0 , 2 003 



INVENTOR- INFORMATION : 
NAME 

Zhang; Hong- Jiang 
Yong ; Ma 



CITY 
Beij ing 
Beij ing 



STATE 



ZIP CODE 



Dec 30, 2003 
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CN 

CN 
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L18: Entry 12 of 28 
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DATE- ISSUED: February 25, 2003 

INVENTOR- INFORMATION : 

NAME CITY 

Ornes; Chester Garden Grove 
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Feb 25, 2003 
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r 13. Document ID: US 6484080 B2 

L18: Entry 13 of 28 File: USPT Nov 19, 2002 

US-PAT-NO: 6484080 

DOCUMENT- IDENTIFIER: US 6484080 B2 

TITLE: Method and apparatus for controlling a vehicular component 
DATE- ISSUED : November 19, 2002 
INVENTOR- INFORMATION : 

NAME CITY STATE ZIP CODE COUNTRY 

Breed; David S. Boonton Township, Morris County NJ 



US -CL- CURRENT: 701 /36; 701 /29, 701 /34 



r 14. Document ID: US 6480826 B2 

L18: Entry 14 of 28 File: USPT Nov 12, 2002 

US-PAT-NO: 6480826 

DOCUMENT- IDENTIFIER: US 6480826 B2 

** See image for Certificate of Correction ** 

TITLE: System and method for a telephonic emotion detection that provides operator 
feedback 

DATE- ISSUED: November 12, 2 002 
INVENTOR- INFORMATION : 

NAME CITY STATE ZIP CODE COUNTRY 

Pertrushin; Valery A. Arlington Heights IL 

US -CL- CURRENT: 704/270; 704/273, 704/274, 704/275 



r 15. Document ID: US 6463415 B2 

L18: Entry 15 of 28 File: USPT Oct 8, 2002 

US-PAT-NO: 6463415 

DOCUMENT- IDENTIFIER: US 6463415 B2 

** See image for Certificate of Correction ** 

TITLE: 69voice authentication system and method for regulating border crossing 
DATE- ISSUED: October 8, 2002 
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INVENTOR- INFORMATION : 
NAME 

St. John; Vicki 



CITY 

Boulder Creek 



STATE 
CA 



ZIP CODE 



COUNTRY 



US -CL- CURRENT: 704/273 ; 600/300, 600/407, 704/231, 704/270, 704/272, 704/275 



Citation Front 



Classification Date Reference 



Claims KM'i.OC 



l~ 16. Document ID: US 6456991 Bl 

L18 : Entry 16 of" 28 



File: USPT 



Sep 24, 2002 



US-PAT-NO: 6456991 

DOCUMENT- IDENTIFIER: US 64 56 991 Bl 

TITLE: Classification method and apparatus based on boosting and pruning of 
multiple classifiers 

DATE- ISSUED: September 24, 2002 



INVENTOR- INFORMATION : 
NAME 

Srinivasa ; Narayan 
Owechko; Yuri 



CITY 

Thousand Oaks 
Newbury Park 



STATE ZIP CODE 

CA 

CA 



COUNTRY 



US -CL- CURRENT: 706/20; 704/9, 706/16, 706/25 



Classification Date Reference 



r 17. Document ID: US 6445988 Bl 

L18: Entry 17 of 28 



File: USPT 



Sep 3, 2 0 02 



US-PAT-NO: 6445988 

DOCUMENT- IDENTIFIER: US 6445988 Bl 

TITLE: System for determining the occupancy state of a seat in a vehicle and 
controlling a component based thereon 

DATE-ISSUED: September 3, 2002 



INVENTOR- INFORMATION : 
NAME 

Breed; David S. 
DuVall; Wilbur E. 
Johnson; Wendell C. 
Morin; Jeffrey L. 
Xu ; Kunhong 
Kussul; Michael E. 



CITY 

Boonton Township, Morris County 

Kimberling City 

Signal Hill 

Lincoln Park 

Rochester Hills 

Kyiv 



STATE ZIP CODE COUNTRY 

NJ 
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CA 
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MI 
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F 18. Document ID: US 6427137 B2 

L18: Entry 18 of 28 



File: USPT 



Jul 30, 2002 



US - PAT - NO : 6427137 

DOCUMENT- IDENTIFIER: US 6427137 B2 

** See image for Certificate of Correction ** 

TITLE: System, method and article of manufacture for a voice analysis system that 
detects nervousness for preventing fraud 

DATE-ISSUED: July 30, 2002 

INVENTOR- INFORMATION : 
NAME 

Petrushin; Valery A. 



CITY STATE 
Arlington Heights IL 



ZIP CODE COUNTRY 



US -CL- CURRENT: 704 / 273 ; 7O4/209, 704/ 270 , 704/ 272 , 704 / 275 



| Citation | Front | Review | Classification 



T 19. Document ID; US 6353810 Bl 

L18: Entry 19 of 28 



File: USPT 



Mar 5, 2002 



US-PAT-NO: 6353810 

DOCUMENT- IDENTIFIER: US 6353810 Bl 

** See image for Certificate of Correction ** 

TITLE: System, method and article of manufacture for an emotion detection system 
improving emotion recognition 



DATE- ISSUED: March 5, 2 002 

INVENTOR- INFORMATION : 
NAME 

Petrushin; Valery A. 



CITY 

Arlington Heights 



STATE ZIP CODE COUNTRY 
IL 



US -CL- CURRENT: 704/236; 704/246, 704/270 



Citation Front 



Classification 



r 20. Document ID: US 6275806 Bl 

L18: Entry 20 of 28 



File: USPT 



Aug 14, 2001 
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US -PAT-NO : 6275806 

DOCUMENT- IDENTIFIER: US 6275806 Bl 



TITLE: System method and article of manufacture for detecting emotion in voice 
signals by utilizing statistics for voice signal parameters 

DATE- ISSUED: August 14, 2001 



INVENTOR- INFORMATION : 
NAME 

Pertrushin; Valery A. 



CITY 

Arlington Heights 



STATE ZIP CODE COUNTRY 
IL 



US -CL- CURRENT: 704 / 272 ; 704/270 
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T 21. Document ID: US 6151571 A 

L18: Entry 21 of 28 



File: USPT 



Nov 21, 2000 



US -PAT-NO : 6151571 

DOCUMENT- IDENTIFIER: US 6151571 A 

TITLE: System, method and article of manufacture for detecting emotion in' voice 
signals through analysis of a plurality of voice signal parameters 

DATE- ISSUED: November 21, 2000 



INVENTOR- INFORMATION : 
NAME . 

Pertrushin; Valery A. 



CITY 

Arlington Heights 



STATE ZIP CODE 
IL 



COUNTRY 



US -CL- CURRENT: 704/209; 704/207, 704/270 



Full 


Title 


Citation 


Front 



r 22. Document ID: US 6135965 A 

L18: Entry 22 of 28 



File: USPT 



Oct 24, 2000 



US-PAT-NO: 6135965 

DOCUMENT- IDENTIFIER: US 6135965 A 

TITLE: Spectroscopic detection of cervical pre-cancer using radial basis function 
networks 

DATE- ISSUED: October 24, 2000 



INVENTOR- INFORMATION : 
NAME 

Turner; Kagan 
Ramanu j am ; Nirmala 
Richards -Kortum; Rebecca 
Ghosh; Joydeep 



CITY STATE 

Sunnyvale CA 

Philadelphia PA 

Austin TX 

Austin TX 



ZIP CODE COUNTRY 



US -CL- CURRENT: 600/476; 128/925, 600/408 



Classification 
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r 23. Document ID: US 6009199 A 

L18: Entry 23 of 28 



File: USPT 



Dec 28, 1999 



US-PAT-NO: 6009199 

DOCUMENT- IDENTIFIER: US 6009199 A 

TITLE: Classification technique using random decision forests 
DATE- ISSUED: December 28, 1999 



INVENTOR- INFORMATION : 

NAME CITY 

Ho; Tin Kam Cedar Grove 

US -CL- CURRENT: 382/224; 382/226, 382/227 



STATE 
NJ 



ZIP CODE 



COUNTRY 



Classification 



\Z 24. Document ID: US 5930392 A 

L18 : Entry 24 of 28 



File: USPT 



Jul 27, 1999 



US -PAT-NO: 5930392 

DOCUMENT- IDENTIFIER: US 593 03 92 A 

TITLE: Classification technique using random decision forests 
DATE- ISSUED: July 27, 1999 



INVENTOR- INFORMATION : 

NAME CITY 

Ho; Tin Kam Cedar Grove 

US -CL- CURRENT: 382/224; 382/226, 382/227 



STATE 
NJ 



ZIP CODE 



COUNTRY 



Classification 



r 25. Document ID: US 5776063 A 

L18: Entry 25 of 28 
US-PAT-NO: 5776063 

DOCUMENT- IDENTIFIER: US 5776063 A 
TITLE: Analysis of ultrasound images in 
DATE- ISSUED: July 7, 1998 



File: USPT Jul 7, 1998 



the presence of contrast agent 
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INVENTOR- INFORMATION : 
NAME 

Dittrich; Howard 
Levene; Harold 
Mjolsness; Eric 



CITY 

San Diego 
San Diego 
Encinitas 



STATE 
CA 
CA 
CA 



ZIP CODE 



COUNTRY 



US -CL- CURRENT: 600/408; 600/458 



Classification Date Reference 



r 26. Document ID: US 5502688 A 

L18: Entry 26 of 28 



File: USPT 



Mar 26, 1996 



US-PAT-NO: 5502688 

DOCUMENT- IDENTIFIER: US 5502688 A 

TITLE: Feedforward neural network system for the detection and characterization of 
sonar signals with characteristic spectrogram textures 

DATE- ISSUED: March 26, 1996 



INVENTOR- INFORMATION : 
NAME 

Recchione; Michael C. 
Russo; Anthony P. 



CITY 
Nut ley 
Bloomf ield 



STATE 

NJ 

NJ 



ZIP CODE 



COUNTRY 



US -CL- CURRENT: 367/131; 367/135, 367/901 



r 27. Document ID: EP .1378855 A2 

L18: Entry 27 of 28 



File: EPAB 



Jan 7, 2004 



PUB-NO: EP001378855A2 

DOCUMENT- IDENTIFIER: EP 1378855 A2 

TITLE: Exploiting ensemble diversity for automatic feature extraction 
PUBN-DATE: January 7, 2004 



INVENTOR- INFORMATION : 
NAME 

YAO, XIN PROF DR 
BROWN, GAVIN 
SENDHOFF, BERNHARD 
WERSING, HEIKO 



COUNTRY 

GB 

GB 

DE 

DE 



INT-CL (IPC) : G06 K 9/62 

EUR-CL (EPC) : G06K009/62; G06K009/66 
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Classification 



r 28. Document ID: IE 83594 B, WO 200261679 A2, IE 82434 B3, AU 2002230051 Al, EP 
1417643 A2, US 20040093315 Al 

L18: Entry 28 of 28 File: DWPI Sep 22, 2004 



DERWENT -ACC- NO: 2002-548570 
DERWENT - WEEK : 2 004 62 

COPYRIGHT 2 005 DERWENT INFORMATION LTD 

TITLE: Prediction model generation for neural network training, trains an ensemble 
of neural networks, estimates a performance error value and uses it to train a 
subsequent ensemble of neural networks in an iterative process 

INVENTOR: CARNEY, J 

PRIORITY-DATA: 2001IE- 0000075 (January 31, 2001) 



PATENT- FAMILY: 
PUB -NO 
IE 83594 B 
WO 200261679 A2 
IE 82434 B3 
AU 2002230051 Al 
EP 1417643 A2 
US 20040093315 Al 



PUB -DATE 

September 22, 2004 
August 8, 2002 
August 21, 2002 
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The distributed boostin g algorithm 
Aleksandar Lazarevic, Zoran Obradovic 

August 2001 Proceedings of the seventh ACM SIGKDD international conference on 
Knowledge discovery and data mining 

Full text available: ^| pdf (625.23 KB) Additional Information: full citation , abstract , references , index terms 

In this paper, we propose a general framework for distributed boosting intended for 
efficient integrating specialized classifiers learned over very large and distributed 
homogeneous databases that cannot be merged at a single location. Our distributed 
boosting algorithm can also be used as a parallel classification technique, where a massive 
database that cannot fit into main computer memory is partitioned into disjoint subsets for 
a more efficient analysis. In the proposed method, at each boost ... 



Keywords: Boosting, classifier ensembles, distributed learning 



2 KDD-99 conference reports: KDD'99 competition: knowled g e discov er y contes t 

Jim Georges, Anne H. Milley 

January 2000 ACM SIGKDD Explorations Newsletter, Volume l issue 2 

Full text available: ^| pdf( 801.43 KB ) Additional Information: full citation, abstra ct, ref erence s 

In this paper, we expand on the 1998 KDD cup competition findings: exploratory data 
analysis reveals unusual data anomalies; a two-stage prediction model yields superior 
results those obtained in the 1998 competition; we use a decision tree better understand 
the model (the decision boundary); and we apply a confidence interval to establish a range 
upon which we can reasonably judge model performance. 

Keywords: decision tree, model performance, neural network, two-stage prediction 
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January 2002 ACM SIGKDD Explorations Newsletter volume 3 issue 2 

Full text available:^ pdf( 1 .16 MB ) Additional Information: full c itation, abstract, refer ences , citings 

This paper describes an experimental mobile data mining system that allows intelligent 
monitoring of time-critical financial data from a hand-held PDA. It presents the overall 



http://portal.acm.org/resultsxfm?coll=ACM&dl=ACM&CFID=50538968& 7/20/2005 



Results (page 1): "neural networks" ensemble and "predictive model" 



Page 2 of 6 



system architecture and the philosophy behind the design. It explores one particular aspect 
of the system— automated construction of personalized focus area that calls for user's 
attention. This module works using data mining techniques. The paper describes the data 
mining component of the system that employs a novel Four ... 

4 Reports: Prot ein match ing with c ust om ne ural network obj ectiv e functions 
David S. Vogel, Eric Gottschalk, Morgan C. Wang 

December 2004 ACM SIGKDD Explorations Newsletter volume 6 issue 2 
Full text available: ^pdf( 155.45 KB) Additional Information: full citation, abstract, references 

This 2004 KDD Cup presents a perfect case where the usual neural network objective 
functions do not apply. While the contest problem consisted of 4 different entries with 4 
different objective functions, this paper will focus on the solution optimizing GRMSE 
(Grouped Root Mean Squared Error). It will be shown that the more typical objective 
functions (including RMSE) cannot be as effective at meeting this criteria. While this 
objective function may be specific to this problem, and the reader may ... 

Keywords: KDD, KDD Cup, MITCH, classification, homology, machine learning, neural 
networks, objective functions, protein 
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5 S p ecial issue on learning f r om imbalanced datasets: Minority report in fraud detection: Q 
classification o f s ke wed data 
Clifton Phua, Damminda Alahakoon, Vincent Lee 
June 2004 ACM SIGKDD Explorations Newsletter, volume 6 issue l 

Full text available: ^pdff 2 62. 38 KB ) Additional Information: full ci ta tion , abstract , references, citings 

This paper proposes an innovative fraud detection method, built upon existing fraud 
detection research and Minority Report, to deal with the data mining problem of skewed 
data distributions. This method uses backpropagation (BP), together with naive Bayesian 
(NB) and C4.5 algorithms, on data partitions derived from minority oversampling with 
replacement. Its originality lies in the use of a single meta-classifier (stacking) to choose 
the best base classifiers, and then combine these base ... 

Keywords: fraud detection, meta-learning, multiple classifier systems 
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S. Bailey, R. Grossman, H. Sivakumar, A. Turinsky 

January 1999 Proceedings of the 1999 ACM/IEEE conference on Supercomputing 
(CDROM) 

Full text available:^ pdf( 538.23 KB ) Additional Information: f ull c itation, re fe rences, citings, i nd ex terms 
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Statistical methods I: MARK: a boostin g alg orithm for hetero g eneous kernel models 
Kristin P. Bennett, Michinari Momma, Mark J. Embrechts 

July 2002 Proceedings of the eighth ACM SIGKDD international conference on 
Knowledge discovery and data mining 

Full text available: ^„pdf(774 00 KB) Additional Information: fuN citation, abstract, references, citings, index 

Support Vector Machines and other kernel methods have proven to be very effective for 
nonlinear inference. Practical issues are how to select the type of kernel including any 
parameters and how to deal with the computational issues caused by the fact that the 
kernel matrix grows quadratically with the data. Inspired by ensemble and boosting 
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methods like MART, we propose the Multiple Additive Regression Kernels (MARK) algorithm 
to address these issues. MARK considers a large (potentially infinite ... 

8 Gene tic algorithms: Ev o lving neural network ensembles forcontrol problems 
David Pardoe, Michael Ryoo, Risto Miikkulainen 

June 2005 Proceedings of the 2005 conference on Genetic and evolutionary 
computation GECCO '05 

Full text available:^ pd ff 12 1.49 KB ) Additional Information: full citation, abstract, Leferences, index.tgrms 

In neuroevolution, a genetic algorithm is used to evolve a neural network to perform a 
particular task. The standard approach is to evolve a population over a number of 
generations, and then select the final generation's champion as the end result. However, it 
is possible that there is valuable information present in the population that is not captured 
by the champion. The standard approach ignores all such information. One possible solution 
to this problem is to combine multiple Individuals fro ... 

Keywords: ensembles, genetic algorithms, neural networks, reinforcement learning 
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9 Reports from KDD-2001; .KDD.Cup 2001 report 

Jie Cheng, Christos Hatzis, Hisashi Hayashl, Mark-A. Krogel, Shinichi Morishita, David Page, 
Jun Sese 

January 2002 ACM SIGKDD Explorations Newsletter volume 3 issue 2 

Full text available: ^ pdf (1.96 MB) Additional Information: full citatio n, abstrac t, r eferen ces, citings 

This paper presents results and lessons from KDD Cup 2001. KDD Cup 2001 focused on 
mining biological databases. It involved three cutting-edge tasks related to drug design and 
genomics. 

Keywords: Competition, biology, drug design, genomics 



1 0 KDD-99 conference reports: KDD-cup 99: knowledge discovery in ajcha rita bje 
organization's donor database 
Saharon Rosset, Aron Inger 

January 2000 ACM SIGKDD Explorations Newsletter volume l issue 2 
Full text available: ^ pdf(557.52 KB) Additional Information: full citation, citings 



11 Data filMnng. for . autom from reflectance spectra 
Jonathan Moody, Ricardo Silva, Joseph Vanderwaart 

August 2001 Proceedings of the seventh ACM SIGKDD international conference on 
Knowledge discovery and data mining 

Full text available:^ pdf( 597.99 KB) Additional Information: f ull cita tion, abstract, references, index terms 

The ability to identify the mineral composition of rocks and soils is an important tool for the 
exploration of geological sites. For instance, NASA intends to design robots that are 
sufficiently autonomous to perform this task on planetary missions. Spectrometer readings 
provide one important source of data for identifying sites with minerals of interest. 
Reflectance spectrometers measure intensities of light reflected from surfaces over a range 
of wavelengths. Spectral intensity patterns may in ... 

1 2 Incorporating contextual inform a tip n in recommender systems usi ng _a 
multidimensional approach 

Gediminas Adomavicius, Ramesh Sankaranarayanan, Shahana Sen, Alexander Tuzhilin 
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January 2005 ACM Transactions on Information Systems (TOIS), volume 23 issue l 
Full text available: ^| pdf( 423.91 KB ) Additional Information: full citation, abstract, reference s, index terms 

The article presents a multidimensional (MD) approach to recommender systems that can 
provide recommendations based on additional contextual information besides the typical 
information on users and items used in most of the current recommender systems. This 
approach supports multiple dimensions, profiling information, and hierarchical aggregation 
of recommendations. The article also presents a multidimensional rating estimation method 
capable of selecting two-dimensional segments of ratings pert ... 

Keywords: Recommender systems, collaborative filtering, context-aware recommender 
systems, multidimensional data models, multidimensional recommender systems, 
personalization, rating estimation 



1 3 Contribued articles: A critical review of multi-ob j ective optimization in data minin g: a Q 
position paper 
Alex A. Freitas 

December 2004 ACM SIGKDD Explorations Newsletter volume 6 issue 2 

Full text available: ^pdf(211.81 KB) Additional Information: full citatio n, abstract, references 

This paper addresses the problem of how to evaluate the quality of a model built from the 
data in a multi-objective optimization scenario, where two or more quality criteria must be 
simultaneously optimized. A typical example is a scenario where one wants to maximize 
both the accuracy and the simplicity of a classification model or a candidate attribute subset 
in attribute selection. One reviews three very different approaches to cope with this 
problem, namely: (a) transforming the original mult ... 

Keywords: classification, lexicographic approach, multi-objective optimization, pareto 
dominance 



14 KDD-99 conference reports: Knowled g e discovery in databases: 10 years after 
Gregory Piatetsky-Shapiro 

January 2000 ACM SIGKDD Explorations Newsletter volume l issue 2 

Full text available: ^ pdf(264.76 KB ) Additional Information: full citation , abstract , references 

In this paper, we describe the past 10 years of KDD and outline predictions for the next 10 
years. 

Keywords: KDD, data mining, history, knowledge discovery in databases 



1 5 Intrusion and privac y: Exploiting unlabeled data in ensemble methods 
Kristin P. Bennett, Ayhan Demiriz, Richard Maclin 

July 2002 Proceedings of the eighth ACM SIGKDD international conference on 
Knowledge discovery and data mining 

Full text available:^ pdf(719.46 KB) Additional Information: full citation, abstract, references, index terms 

An adaptive semi-supervised ensemble method, ASSEMBLE, is proposed that constructs 
classification ensembles based on both labeled and unlabeled data. ASSEMBLE alternates 
between assigning "pseudo-classes" to the unlabeled data using the existing ensemble and 
constructing the next base classifier using both the labeled and pseudolabeled data. 
Mathematically, this intuitive algorithm corresponds to maximizing the classification margin 
in hypothesis space as measured on both the labeled and unlabel ... 

Keywords: boosting, classification, ensemble learning, semi-supervised learning 
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1 6 Learnin g Ensembles from Bites: A Scalable and Accurate A pproach Q 
Nitesh V. Chawla, Lawrence 0. Hall, Kevin W. Bowyer, W. Philip Kegelmeyer 

December 2004 The Journal of Machine Learning Research, volume 5 

Full text available: ^| pdf(3.34 MB) Additional Information: fujIcLtatjpn, abs tract , index ierms 

Bagging and boosting are two popular ensemble methods that typically achieve better 
accuracy than a single classifier. These techniques have limitations on massive data sets, 
because the size of the data set can be a bottleneck. Voting many classifiers built on small 
subsets of data ("pasting small votes") is a promising approach for learning from massive 
data sets, one that can utilize the power of boosting and bagging. We propose a framework 
for building hundreds or thousands of such classifie ... 

1 7 The principled design of large-scale recursive neural netw^ Q 
and the protein structure prediction problem 

Pierre Baldi, Gianluca Pollastri 

December 2003 The Journal of Machine Learning Research, volume 4 

Full text available: ^| pdf (231.40 KB ) Additional Information: f ull citation , abstract , references , index terms 

We describe a general methodology for the design of large-scale recursive neural network 
architectures (DAG-RNNs) which comprises three fundamental steps: (1) representation of 
a given domain using suitable directed acyclic graphs (DAGs) to connect visible and hidden 
node variables; (2) parameterization of the relationship between each variable and its 
parent variables by feedforward neural networks; and (3) application of weight-sharing 
within appropriate subsets of DAG connections to capture s ... 

18 Cluster ense mble s — a knowledge reuse fr ame work for combining multiple parti tion s Q 
Alexander Strehl, Joydeep Ghosh 

March 2003 The Journal of Machine Learning Research, volume 3 



This paper introduces the problem of combining multiple partitionings of a set of objects 
into a single consolidated clustering without accessing the features or algorithms that 
determined these partitionings. We first identify several application scenarios for the 
resultant 'knowledge reuse 1 framework that we call cluster ensembles. The cluster ensemble 
problem is then formalized as a combinatorial optimization problem in terms of shared 
mutual information. In addition to a direct ... 

Keywords: cluster analysis, clustering, consensus functions, ensemble, knowledge reuse, 
multi-learner systems, mutual information, partitioning, unsupervised learning 
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William J. E. Potts 

August 1999 Proceedings of the fifth ACM SIGKDD international conference on 
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Full text available: ^Qdf(62873KB) Additional Information: full citation, references , index terms 
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and parameter varyin g ensembles 
Yan-Shi Dong, Ke-Song Han 

March 2005 Proceedings of the 2005 ACM symposium on Applied computing 

Full text available: ^pdf (231.21 KB) Additional Information: full citation , abstract , references , index terms 

Support vector machines (SVM) are among the best text classifiers so far. Meantimes, 
ensembles of classifiers are proven to be effective on many domains. It is expected that 
ensembles of SVM classifiers could achieve better performance. In this paper two types of 
ensembles on SVM classifiers, the data partitioning ensembles and heterogeneous 
ensembles, have been proposed and experimentally evaluated on three well-accepted 
collections. Major conclusions are that disjunct partitioning ensembles wi ... 

Keywords: ensemble, support vector machines, text classification 
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